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1  Introduction 


Generally  speaking,  mathematicians  and  computer  scientists  are  not  satisfied  with  the  level 
at  which  college  students  understand  math.  Students  have  difficulty  constructing  proofs, 
their  reasoning  abilities  are  inadequate,  and  they  don’t  know  what  “rigor”  means.  Moreover, 
their  fear  of  formalism  and  rigor  impedes  learning  in  science  and  engineering  courses,  as  well 
as  in  math  courses. 

This  paper  describes  a  new  approach  to  teaching  discrete  math  — at  the  freshman  level. 
Our  experience  with  the  approach  leads  us  to  believe  that  it  engenders  a  fresh  emd  positive 
outlook  on  mathematics  and  proof,  addressing  the  problems  mentioned  above. 

A  key  ingredient  of  our  approach  is  an  equational  treatment  of  propositional  and  pred¬ 
icate  calculus  — instead  of  the  natural-deduction  and  Hilbert  styles  that  currently  prevail. 
The  equational  treatment  has  several  advantages. 


1.  It  is  easily  learned,  since  it  builds  on  students’  familiarity  with  equational  reasoning 
from  high-school  algebra. 

2.  After  just  three  days  of  teaching  logic,  problems  can  be  solved  that  would  be  difficult 
without  the  logic,  thus  providing  motivation  for  further  study  and  for  developing  a 
skill. 


3.  The  logic  can  be  applied  rigorously,  without  complexity  and  detail  overwhelming  (a 
criticism  of  other  logics),  so  it  becomes  a  useful  tool. 

4.  The  accompanying  rigid  proof  format  allows  the  introduction  and  discussion  of  proof 
principles  and  strategies,  thereby  giving  students  direction  in  developing  proofs. 


5.  The  logic  can  be  extended  to  other  areas,  including  2dl  those  typically  taught  in  discrete 
math  courses  (e.g.  set  theory  and  a  theory  of  integers). 
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6.  The  problem-solving  paradigm  that  students  see  in  high  school  is  reinforced:  (0)  for¬ 
malize,  then  (1)  manipulate  the  formalization  according  to  rigorous  rules,  and  finally 
(2)  interpret  the  results. 

Our  course  in  discrete  math  starts  with  six  weeks  of  propositional  and  predicate  logic. 
The  emphasis  is  on  giving  students  a  skill  in  formal  manipulation.  Students  see  many 
rigorous  proofs  and  develop  many  themselves,  in  a  rigid  proof  style  (not  in  English).  They 
practice  applying  proof  principles  and  strategies  made  possible  by  the  proof  style.  Along  the 
way,  they  learn  that  attention  to  rigor  may  be  a  simplifying  force  — instead  of  an  onerous 
burden. 

The  unit  on  logic  does  include  a  discussion  of  informal  presentations  of  proofs  typically 
found  in  mathematics  (e.g.  proof  by  contradiction)  and  how  they  are  bjised  in  logic  (e.g. 
proof  by  contradiction  is  based  on  the  theorem  p  =  -^p  =>  false  ).  Thus,  students  see  the 
connection  between  the  informal  proof  outlines  they  see  in  other  courses  and  our  rigorous 
proof  style. 

The  rest  of  our  course  covers  a  variety  of  other  topics  of  discrete  math,  e.g.  set  theory, 
a  theory  of  integers,  mathematical  induction,  relations  and  functions,  combinatorics,  and 
solving  recurrence  relations.  Each  topic  is  presented  in  the  same  rigorous  style  in  which 
logic  was  presented,  by  giving  the  necessary  axioms  for  the  theory  and  building  up  a  library 
of  theorems.  For  example,  set  theory  is  introduced  by  adding  to  pure  predicate  logic  the 
axioms  that  characterize  set  comprehension  and  the  set  operators.  Proofs  about  these  set 
operators  are  then  presented  in  the  same  style  as  before.  Thus,  the  notions  of  proof  and 
proof  style  become  the  unifying  force  in  the  course. 

Many  believe  that  stressing  rigor  turns  students  away  from  math.  Our  experience  is  just 
the  opposite.  Students  take  comfort  in  the  rigor,  because  it  gives  them  a  basis  for  knowing 
when  a  proof  is  correct.  And,  we  find  our  students  applying  what  they  have  learned  to  other 
courses  — before  our  discrete  math  course  ends.  However,  success  in  teaching  logic  and  rigor 
does  require  that  students  have  time  to  digest  the  new  notations  and  to  become  skilled  in 
formula  manipulation. 

The  paper  proceeds  as  follows.  First,  we  compare  our  equational  proofs  with  more 
conventional  approaches.  We  then  turn  (in  Section  3)  to  our  equational  propositional  logic 
and  discuss  how  we  teach  it.  We  follow  this  up  with  a  discussion  of  quantification  and 
predicate  logic.  In  Section  5,  we  show  how  to  use  our  logic  as  the  basis  for  other  topics  in 
discrete  math.  Our  approach  to  logic  meets  resistance  from  those  who  feel  that  meaning 
and  understanding  is  sacrificed  when  formal  manipulation  is  emphasized.  We  address  this 
criticsm  in  Section  6.  We  conclude  in  Section  7  with  some  comments  about  experiences  in 
teaching  the  new  approach. 
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A  comparison  of  proof  methods 


Consider  proving,  from  set  theory,  that  union  distributes  over  intersection: 

(1)  >1  u  (s  n  C)  =  (>i  u  B)  n  u  C)  . 

Most  mathematicians  favor  the  following  kind  of  proof  — at  least  their  texts  contain  such 
proofs.  ^ 

Proof.  VVe  first  show  that  A  U  {B  D  C)  C  (.4  U  fl)  PI  (j4  U  C) .  If  r  e  ^4  U  (B  D  C) , 
then  either  A  or  areBnC.  If  x€j4,  then  certainly  a;  €  A  U  B  and  xi  A  U  C , 
so  X  e  (.4  U  B)  n  (.4  U  C) .  On  the  other  hand,  if  x  e  B  D  C ,  then  x  €  B  and  xeC , 
so  x€  A  U  B  and  x  6  ^4  U  C ,  so  x  €  (>l  U  B)  fi  (.4  U  C) .  Hence,  4  U  (B  D  C)  C 
(4  U  B)  n  (4  U  C) . 

Conversely,  if  y  €  (4  U  B)  n  (4  U  C) ,  then  y  €  4  U  B  and  y  €  4  U  C  .  We  consider 
two  cases:  ye  4  and  yiA.  If  ye  4,  then  ye4  U  (B  n  C),  and  this  part  is  done. 
If  yiA,  then,  since  y  e  4  U  B  we  must  have  yiB.  Similarly,  since  y  e  4  U  C  and 
ye4,  we  have  y^C.  Thus,  yiB  n  C,  and  this  implies  ye4  U  (B  fi  C).  Hence 
(4  U  B)  n  (4  U  C)  C  4  U  (B  n  C) .  theorem  follows.  □ 

This  proof  has  several  shortcomings: 

•  It  is,  unnecessarily,  a  ping-pong  argument  — equality  is  shown  by  proving  two  set 
inclusions. 

•  The  two  case  analyses  complicate  the  argument. 

•  The  use  of  English  obscures  the  structure  of  the  proof. 

•  Some  expressions,  for  example  x  e  4  and  x  e  4  U  B  ,  are  repeated  several  times,  thus 
lengthening  the  proof  unnecessarily.  (Replacing  some  of  the  expressions  with  pronouns 
would  only  make  matters  worse  by  introducing  ambiguity) 

•  The  justifications  for  inferences  within  the  proof  are  not  given;  too  much  is  left  for  the 
reader  to  fill  in.  For  example,  the  proof  says,  “If  x  (?  4  ,  then,  since  x  e  4  U  B  we 
must  have  x  6  B  ” ,  but  no  reference  is  given  to  the  theorem  being  used  to  substantiate 
this  inference. 

•  The  proof  style  is  too  undisciplined  to  invite  useful  discussion  of  proof  strategies  and 
principles.  After  seeing  one  proof  like  this,  students  still  have  difficulty  constructing 
similar  proofs. 

^  Lay  [7,  p.  36]  gives  tliis  proof,  with  a  few  formulas  replaced  by  blanks  for  the  reader  to  fill  in. 
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Some  mathematicians  and  computer  scientists  favor  teaching  Gerhard  Gentzen’s  natural- 
deduction  system  [3]  or  something  similar.  They  argue  (correctly)  that  Gentzen  developed 
natural  deduction  in  order  to  formalize  how  mathematicians  argue  in  English.  However, 
formalizing  contorted  and  difficult  English  arguments  is  of  dubious  utility  — the  contortions 
and  difficulties  remain.  VVe  favor  proof  methods  that  allow  us  to  bypass  problems  introduced 
by  the  use  of  imprecise  and  unwieldy  natural  language. 

Note  that  we  do  not  advocate  altogether  avoiding  English,  informality,  ping-pong  ar¬ 
guments,  or  case  analysis.  They  should  be  used  when  they  lead  to  proofs  that  are  more 
accessible  to  the  reader.  Too  often,  however,  they  are  used  to  ease  the  task  of  the  writer  at 
the  expense  of  the  reader,  when  a  more  mathematical  style  would  be  more  effective. 

Having  criticized  the  traditional  proof  style,  we  now  illustrate  a  better  one,  by  giving  our 
own  proof  that  union  distributes  over  intersection.  Our  treatment  of  set  theory  extends  an 
equational  logic.  For  this  discussion,  then,  we  assume  that  theorems  of  propositional  logic 
and  pure  predicate  logic,  like  the  following  one,  have  already  been  proved. 

(2)  p  V  (9  A  r)  =  (p  V  9)  A  (p  V  r) 

Equality  of  sets  is  defined  using  the  axiom  of  Extensionality  (we  describe  our  notation  for 
quantification  in  Section  4). 

(3)  Extensionality:  S  =  T  =  (Vu|:v€S  =  veT) 

Further,  union  and  intersection  of  sets  are  defined  by  membership  tests. 

(4)  Vi  B  U  C  =  V  i  B  V  V  i  C 

(5)  Vi  B  n  C  =  Vi  B  A  viC 

Note  that  set  membership,  set  equality,  and  other  set  operators  are  defined  axiomatically, 
and  in  a  way  that  is  useful  in  formal  manipulation  — not,  as  is  traditional,  by  an  informal 
model  of  evaluation. 

We  now  prove  theorem  (1):  union  distributes  over  intersection. 

Proof.  By  Extensionality  (3),  we  can  prove  (1)  by  showing  that  an  arbitrary  element  v  is 
in  the  LHS  of  (1)  exactly  when  it  is  in  the  RHS: 

t;  €  AU  (B  nC) 

=  (Definition  of  U  (4)} 

Vi  A  V  Vi B  n  C 
=  (Definition  of  D  (5)) 

Vi  A  V  {vi  B  A  ViC) 

=  (Distributivity  of  V  over  A  (2)) 

{viAVviB)  A  {viAyviC) 

=  (Definition  of  U  (4),  twice) 
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(veyluB)  A  (t)€>luC) 

=  (Definition  of  fl  (5)) 
i;  €  (/I  U  5)  n  (A  U  C) 

This  proof  uses  substitution  of  equals  for  equals  (or  “Leibniz”,  as  we  call  it)  as  its  main 
inference  rule,  as  well  as  transitivity  of  equality  to  link  steps.  College  freshmen  are  familiar 
with  these  rules  from  high-school  algebra  and  therefore  become  comfortable  with  the  proof 
style  rather  quickly. 

The  proof  has  the  following  nice  properties; 

•  All  arguments  are  e.xplicit  and  rigorous;  With  each  step,  a  hint  between  the  two  equal 
terms  gives  the  theorem  that  indicates  why  the  two  terms  are  equal. 

•  The  proof  is  simple  and  direct.  Its  simplicity  accentuates,  rather  than  hides,  the 
relation  between  disjunction  and  union  and  between  conjunction  and  intersection. 

•  The  strategy  used  in  the  proof  is  easy  to  identify  and  teach;  To  prove  something 
about  operators  (here,  U  and  fl ),  use  their  definitions  to  eliminate  them,  perform 
some  manipulations,  and  then  use  their  definitions  to  reintroduce  them. 

Now  consider  a  theorem  whose  proof  involves  quantification  (even  though  the  statement 
of  the  theorem  does  not).  At  the  recent  AMS  meeting  in  Cincinnati,  a  mathematician 
demonstrated  a  computer  program  for  helping  in  the  development  of  proofs.  The  program’s 
interface  was  elegant  and  easy  to  use,  but  the  underlying  proof  system  was  not.  The  logic 
wris  natural  deduction,  augmented  with  definitions  from  set  theory.  The  mathematician 
demonstrated  his  system  with  a  proof  of 

(6)  ACC  ^  BCC  =>  AU5CC  . 

Because  the  definition  of  C  is  in  terms  of  universal  quantification, 

(7)  B  C  C  =  (Vx  I  I  e  fl  ;  X  e  C)  , 

his  formal  proof  required  several  levels  of  nesting  (there  were  proofs  within  proofs  within 
proofs),  as  well  as  case  analysis.  The  proof  was  not  easy  to  follow  or  develop.  Below  is 
our  simpler,  equational  proof.  It  happens  to  use  the  same  proof  strategy  as  the  previous 
equational  proof;  eliminate  an  operator,  manipulate,  and  reintroduce  the  operator. 

ACC  A  BCC 

—  (Definition  of  C  (7),  twice) 

(Vx  I  X  €  A  ;  X  e  C)  A  (Vx  |  x  €  fl  ;  x  e  C) 

=  (Split  range) 

(Vx  |x€AVx€fl;xeC) 
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=  (Definition  of  U  (4)} 

(Vr  \  X  i  A  \J  B  ■.  X  ^  C) 

=  (Definition  of  C  (7)} 

AU  BCC 

An  equivalence  has  been  proved,  instead  of  implication  (6)!  Extending  the  speaker’s  proof 
to  justify  this  equivalence  would  double  the  length  of  his  proof,  because  natural  deduction 
is  being  used. 

In  the  two  examples  presented,  the  equational  approach  is  far  superior  to  more  conven¬ 
tional  approaches.  There  is  much  less  writing,  the  proofs  are  absolutely  rigorous  yet  simple, 
and  the  proofs  are  easy  to  digest  and  repeat  to  others.  All  one  has  to  know  is  an  equational 
version  of  the  pure  predicate  calculus  and  the  few  definitions  of  set  theory. 

Were  these  just  isolated  instances  of  the  superiority  of  the  equational  approach,  there 
would  little  reason  to  discuss  it.  However,  in  our  experience,  the  equational  approach  ex¬ 
hibits  these  advantages  in  all  topics  that  are  traditionally  included  in  a  first-semester  discrete 
math  course  — set  theory,  theory  of  integers,  modular  arithmetic,  mathematical  induction, 
relations,  functions,  combinatorics,  solving  recurrence  relations,  and  modern  algebra.  This 
should  not  be  surprising,  since  logic  is  the  glue  that  binds  together  arguments  in  all  domains. 
Moreover,  bringing  this  glue  to  the  fore  provides  the  unifying  theme  that  has  been  missing 
for  so  long  in  discrete  math  courses. 


3  Equational  propositional  logic 

We  now  outline  our  equational  propositional  logic  and  discuss  teaching  it.  The  logic  has 
the  same  theorems  as  any  conventional  propositional  logic;  the  difference  is  in  the  inference 
rules,  the  axioms  (and  the  order  in  which  they  are  introduced),  and  the  definition  of  a  proof. 
For  example,  equivalence  (i.e.  equality  over  the  boolean  domain)  plays  a  prominent  role  in 
our  logic;  in  comparison,  it  is  a  second-class  citizen  in  most  other  propositional  logics,  where 
implication  dominates. 

We  profit  from  using  two  different  symbols  for  equality.  Generally  speaking,  the  expres¬ 
sion  6  =  c  is  defined  as  long  as  b  and  c  have  the  same  type  — e.g.  both  booleans,  both 
integers,  both  set  of  integers,  both  graphs.  Further,  equality  =  is  treated  conjunctionally; 
b  =  c  =  d  is  an  abbreviation  for  b  =  c  A  c  =  d . 

Symbol  =  is  used  only  for  equality  over  the  booleans:  b  =  c  is  the  same  as  5  =  c  if 
b  and  c  are  boolean.  Symbol  =  is  assigned  a  lower  precedence  than  =  ;  this  allows  us  to 
eliminate  some  parentheses.  For  e.xample,  we  can  write 

X  =  y  y  X  <  y  =  X  <y  . 

(Note  how  spacing  is  used  to  help  the  reader  with  precedences.)  Formal  manipulation  will 
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be  used  often,  and  we  need  ways  to  keep  formulas  simple. 

A  more  important  benefit  arises  from  having  two  different  symbols  for  equality;  we  can 
make  use  of  the  associativity  of  equality  over  the  booleans.  Thus,  we  write  b  =  c  =  d 
for  either  6  =  (c  =  d)  or  (6  =  c)  =  d,  since  they  are  equivalent.  Had  we  used 
only  =  for  equality,  we  could  not  have  benefited  from  associativity  because  a  =  b  =  c 
already  means  a  =  b  A  b  =  c .  As  with  arithmetic  manipulations,  we  often  use  symmetry 
(commutativity)  and  associativity  of  operators  without  explicit  mention.  Logicians  have 
not  made  use  of  the  associativity  of  =  .  For  example,  in  [8],  Rosser  uses  =  conjunctionally 
instead  of  associatively.  Perhaps  this  is  because  logicians  have  been  more  interested  in 
studying  rather  than  using  logic. 

The  three  inference  rules  of  our  equational  logic  are  given  below,  using  the  notation 
Efv  :=  P]  to  denote  textual  substitution  of  expression  P  for  free  occurrences  of  variable  v 
in  expression  E : 

P  =  Q 

Leibniz:  — ; - ^ 

E[v  :=  P]  =  E[v  :=  Q] 

■  P  =  Q,Q  =  R 

Transitivity:  - - - - - 

P  ~  li 

P 

Substitution:  —7 - 

P[v  :=  Q] 

A  theorem  of  our  propositional  calculus  is  either  (i)  an  axiom  or  (ii)  a  boolean  expression 
that,  using  the  inference  rules,  is  proved  equal  to  an  eixiom  or  a  previously  proved  theorem. 
We  also  have,  as  a  metatheorem,  that  P  =  Q  is  a  theorem  iff  P  can  be  transformed  to 
Q  (or  Q  to  P )  using  these  inference  rules. 

Our  proofs  generally  follow  the  format  of  the  equational  proofs  given  earlier  (although 
there  are  some  extensions).  With  this  proof  format,  uses  of  inference  rules  can  be  left 
implicit  — there  is  no  need  to  mention  the  rules.  Contrast  this  with  natural  deduction  and 
Hilbert-style  logics,  where  the  plethora  of  inference  rules  dictates  the  explicit  mention  of 
each  use  of  a  rule. 

Inference  rule  Leibniz  is  used  to  infer  an  equality;  its  use  is  indicated  in  proofs  as: 

E[v  :=  P] 

=  {P  =  Q) 

E[v  :=  Q] 


^  As  another  simplification,  we  write  the  application  of  a  function  f  to  &  simple  argument  6  as  f.b  . 
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Inference  rule  Transitivity  is  used  without  mention  in  the  usual  fashion;  for  example,  it  is 
transitivity  that  allows  us  to  conclude  A  U  (B  C\  C)  =  (A  U  fl)  D  (y4  U  C)  in  the  proof 
on  page  4.  And.  inference  rule  Substitution  is  used  to  generate  an  instance  of  a  theorem 
that  is  to  be  the  premise  of  an  application  of  Leibniz.  For  example,  in  the  step 

p  A  q  Ar 

=  {pAq  =  9  A p,  with  q:=qAr) 
q  A  r  A  p 

the  premise  of  the  instance  of  Leibniz  that  is  being  used  is  pAq  =  q  Ap  with  q  :=  q  Ar  , 
i.e.  p  A  (g  A  r)  =  (g  A  r)  A  p .  Often  in  our  proofs,  Substitution  is  unmentioned  when  its 
use  is  obvious. 


Table  1:  Axioms  of  Equational  Logic 

(8)  Associativity  of  =  :  ((p  =  g)  =  r)  =  (p  =  (g  =  r)) 

(9)  Symmetry  of  =  :p  =  g  =  g  =  p 

(10)  Identity  of  = :  true  =  g  =  g 

(11)  Definition  of  false  ;  false  =  ~<irue 

(12)  Distributivity  of  ->  over  =:-i(p  =  g)  =  -ipsg 

(13)  Definition  of  ^:{p^q)  =  ->(p  =  q) 

(14)  Symmetry  ofV:pVg  =  gVp 

(15)  Associativity  of  V :  (p  V  g)  V  r  =  p  V  (q  V  r) 

(16)  Idempotency  of  V  ;  p  V  p  =  p 

(17)  Distributivity  of  V  over  =:pV(g  =  r)  =  pVg  =  pVr 

(18)  Excluded  Middle;  p  V  -<p 

(19)  Golden  rule:  pAg  =  p  =  g  =  pVg 

(20)  Definition  of  Implication:  p^g=pVg  =  g 

(21)  Consequence:  p  <=  q  =  q  =>  p 


The  axioms  of  our  equational  logic  are  given  in  Table  1,  ordered  and  grouped  as  we  teach 
them.  Equivalence  is  introduced  first.  And,  because  the  first  axiom  says  that  equivalence 
is  associative,  thereafter  we  eliminate  parentheses  from  sequences  of  equivalences.  For  ex¬ 
ample,  Symmetry  of  =  (9)  is  given  as  p  =  g  =  g  =  p.  Associativity  of  =  allows  us  to 
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parse  (9)  in  five  ways,  thus  reducing  the  number  of  axioms  and  theorems  that  have  to  be 
listed:  ((p  =  9)  =  g)  =  p ,  (p  =  g)  =  (9  =  p) ,  (p  =  (g  =  g))  =  p ,  p  =  ((g  =  g)  =  p) ,  and 
p=  (g  =  (g  =  p)) . 

Our  definition  of  conjunction  is  called  the  Golden  rule  (19).  To  see  that  (19)  is  valid, 
check  its  truth  table  or  else  use  associativity  and  symmetry  to  rewrite  it  as 

p=g  =  pAg=pVg  . 

Now,  it  may  be  recognized  as  the  law  that  says  that  two  booleans  are  equal  iff  their  con¬ 
junction  and  disjunction  are  equal. 

Operator  <=  (see  (21))  is  included  because  some  proofs  are  more  readily  developed  or 
understood  using  it  rather  than  ^  .  As  an  example,  we  prove  that  any  integer  divides  itself, 
i.e.  c|c  holds,  where  ]  is  defined  by 

b\c  =  {3d\:b-d=  .)  . 

Note  that  the  proof  format  below  has  been  extended  to  allow  =>  or  to  appear  in  place 
of  =  and  that  part  of  the  proof  is  given  in  English. 

Proof.  We  calculate: 
c|c 

=  (Definition  of  | ) 

(3d|:c-d  =  c) 

^  ( 3-Introduction  —  P[x  :=  E]  ^  (3i  \  :  P)) 

C‘l  =  c 

—  (Multiplicative  identity) 

true 

Hence,  true  ^  c|c.  By  Left  Identity  of  =>  (i.e.  P  =  true=>P),  c|c  is  a  theorem.  □ 

Were  <=  not  available,  this  proof  would  have  to  be  presented  in  reverse,  as  shown  below. 
But  this  second  proof  is  difficult  to  follow.  The  first  few  formulas  are  rabbits  pulled  out  of  a 
hat.  Why  start  with  true?  Where  did  the  insight  come  from  to  replace  true  by  c-1  =  c? 
A  goal  in  writing  a  proof  should  be  to  avoid  rabbits,  to  have  each  step  guided  or  even  forced 
by  the  structure  of  the  formulas,  so  that  an  experienced  reader  would  say,  “Oh,  I  would 
have  done  that  too.”  A  proof  strategy  that  helps  in  this  regard  is:  Start  with  the  more 
complicated  side  of  an  equivalence  or  implication  and  use  its  structure  to  help  guide  the 
proof. 


true 

(Multiplicative  identity) 
c- 1  =  c 
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Table  2:  Portia’s  Suitor’s  Dilemma 

This  story  is  a  take-off  on  a  scene  in  Shakespeare’s  Merchant  of  Venice.  Portia  has  a 
suitor,  who  wants  to  marry  her.  She  does  not  know  whether  she  wants  to  marry  him. 
So,  she  gives  him  a  puzzle  to  solve.  She  gets  two  boxes,  one  gold  and  the  other  silver, 
and  puts  her  portrait  into  one  of  them.  On  the  gold  box  she  writes  the  inscription  “The 
portreut  is  not  here.”  On  the  silver  box,  she  writes,  “Exactly  one  of  these  two  inscriptions 
is  true.”  She  tells  the  suitor  that  the  inscriptions  may  be  true  or  lalse,  she  won’t  say 
which,  but  if  he  can  determine  which  box  contains  the  portrait,  she  will  marry  him. 

=>  ( 3-Introduction) 

(3d|;c-d  =  c) 

=  (Definition  of  | ) 

cjc 

Teaching  preliminary  concepts  (e.g.  textual  substitution)  and  the  propositional  calculus 
takes  us  at  least  eleven  50-minute  lectures.  Students  are  shown  various  proof  principles  and 
strategies,  are  given  many  proofs,  and  are  asked  to  prove  many  theorems  themselves.  The 
emphasis  is  on  achieving  familiarity  with  the  notation  and  theorems  and  on  gaining  a  skill 
in  formal  manipulation. 

Here  is  a  list  of  proof  principles  and  strategies  that  can  be  taught  with  our  approach. 
They  are  not  obvious  to  mathematical  novices,  and  discussing  them  can  be  enlightening. 

•  Principle.  Structure  proofs  to  avoid  repeating  the  same  subexpression  on  many  lines. 

•  Principle.  Structure  proofs  to  minimize  the  number  of  “rabbits  pulled  out  of  a  hat” 
— make  each  step  obvious,  based  on  the  structure  of  the  expression  and  the  goal  of 
the  manipulation. 

•  Principle.  Lemmas  may  provide  structure,  bring  to  light  interesting  facts,  and  ulti¬ 
mately  shorten  a  proof. 

•  Strategy.  To  prove  something  about  an  operator,  eliminate  it,  manipulate  the  result, 
and  then,  if  necessary,  reintroduce  the  operator. 

•  Strategy.  The  shape  of  an  expression  can  focus  the  choice  of  theorems  to  be  used  in 
manipulating  it.  Thus,  identify  applicable  theorems  by  matching  the  structure  of  the 
expression  and  its  subexpressions  with  the  theorems. 

•  Strategy.  To  prove  P  =  Q  ,  transform  the  term  with  the  most  structure  (either  P 
OT  Q)  into  the  other.  The  idea  here  is  that  the  one  with  the  most  structure  provides 
the  mos.  insight  into  the  next  Leibniz  transformation. 
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Table  3: 

Informal  Proof  Techniques 

Informal  proof  technique  Basis  for  the  informal  technique 

Case  analysis 

(p  V  fl)  A  (p  =>  r)  A  (g  ^  r)  =>  r 

Mutual  implication 

p  =  q  =  (p  =>  9)  A  (9  =>  p) 

Contradiction 

->p  false  =  p 

Contrapositive 

p  z>  q  =  -19  =>  -<p 

Upwards  of  80  theorems  of  the  propositional  calculus  can  be  used  as  examples  and 
exercises  in  illustrating  these  proof  principles  and  strategies.  Students  are  not  asked  to 
memorize  all  80-odd  theorems  — since  mathematicians  don’t.  Rather,  students  always  have 
on  hand  a  list  of  the  theorems  in  the  order  in  which  they  can  be  proved.  Through  continual 
use,  students  begin  to  know  many  of  these  theorems  as  their  formal  friends.  On  a  test,  the 
list  of  theorems  is  distributed  and  students  are  asked  to  prove  certain  ones. 

We  intersperse  the  study  of  propositional  calculus  with  interesting  applications.  Many 
seemingly  difficult  word  problems  can  be  solved  easily  through  formalization  and  manipula¬ 
tion,  so  here  is  where  students  begin  to  see  that  they  are  learning  a  new,  powerful,  mental 
tool.  One  interesting  example  is  Portia’s  Suitor’s  Dilemma  (see  Table  2).  It’s  solution  is 
amazingly  simple  using  our  equational  logic  when  one  formalizes,  manipulates,  and  then 
interprets.  Its  solution  in  natural  deduction  is  much  harder  (see  [9,  6]  for  a  discussion). 

Another  interesting  problem  is  to  make  sense  of  the  following  sentence:  “For  every  value 
of  array  section  6[1..9] ,  if  that  value  is  in  array  section  6[21..25] ,  then  it  is  not  in  6[21..25] .’’ 
This  sentence  may  seem  contradictory,  but  formalizing  it,  simplifying  the  formal  statement, 
and  then  interpreting  the  result  yields  a  surprising  answer. 

Our  study  of  equational  logic  is  completed  with  a  look  at  how  conventional,  seemingly 
informal,  proof  techniques  in  mathematics  are  based  on  theorems  of  propositional  calculus 
(see  Table  3).  Several  proofs  are  given  in  both  an  informal  and  the  equational  style  and 
compared. 

4  Quantification  and  the  predicate  calculus 

The  treatment  of  quantification  in  our  course  unifies  what,  until  now,  has  been  a  rather 
chaotic  topic.  Quantification  in  mathematics  assumes  many  forms,  for  example: 

=  12  +  22 -(-32 

(Vi).l  <  X  <  3  =>  6[a:]  =  0  =  6[1]  =  0  A  6[2]  =  0  A  6[3]  =  0 
(3z).l  <  X  <  3  A  b[x]  =  0  =  6[1]  =  0  V  6[2]  =  0  V  6[3]  =  0 

There  appears  to  be  no  consistency  of  concept  or  notation  here.  Compounding  the  problem 
is  that  students  are  not  taught  rules  for  manipulating  specific  quantifiers  — much  less  general 
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Table  4;  Axioms  for  Quantification 

(23)  Empty  range:  (*x  I  false  :  P)  =  (ti  e  identity  of  * ) 

(24)  One-point  rule:  (*x  I  x  =  E  :  P)  =  P[x  :=  E] 

(25)  Distributivity:  (*x  I  R  :  P)*(*x  I  R  :  Q)  =  (*x  I  R  :  P*Q) 

(26)  Range  split:  Provided  -'(/i  A  5)  holds  or  *  is  idempotent, 

(*x  I  P  V  5  :  P)  =  (*x  I  P  ;  P)  *(**  I  5  :  P) 

(27)  Range  split:  (★x  |  P  V  S  :  P)  *  (*x  I  P  A  5  :  P)  =  (*x  |  P  :  P)  ★  (*x  |  5  :  P) 

(28)  Interchange  of  dummies:  (*x  I  P  :  {-ky  |  Q  :  P))  =  (*y  |  Q  :  (*x  I  P  :  P)) 

(29)  Nesting:  {kx,  y  I  P  A  Q  :  P)  =  (*x  I  P :  (*y  |  Q  :  P)) 

(30)  Dummy  renaming:  (*x  I  P  :  P)  =  (*j/ 1  P[r  :=  y]  :  P[x  :=  y]) 

Note:  The  usual  caveats  concerning  the  absence  of  free  occurrences  of  dummies  in  some 
expressions  are  needed  to  avoid  capture  of  variables.  Further,  some  of  the  axioms  require 
the  ranges  of  quantifications  to  be  finite  or  *  to  be  idempotent. 


rules  that  hold  for  all. 

We  use  a  single  notation  for  all  quantifications.  Let  *  be  any  binary,  associative,  and 
symmetric  operator  that  has  an  identity.  ^  The  notation 

(22)  (ki-.T  I  P.i :  P.i) 

denotes  the  accumulation  of  values  P.i ,  using  operator  ★ ,  over  all  values  i  for  which  range 
predicate  R.i  holds.  T  is  the  type  of  dummy  t ;  it  is  often  omitted  in  contexts  in  which 
the  type  is  obvious.  ®  If  range  P.i  is  true  ,  we  may  write  the  quantification  as  (ki  \ :  P.i) . 

In  the  examples  below,  the  type  is  omitted;  also,  gcd  is  the  greatest  common  divisor 
operator. 

(+1  I  1  <  i  <  3  :  »*)  =  1*  +  2*  +  3^ 

(Ax|3<x<9A  prime.x  :  6[x]  =  0)  =  6[3]  =  0  A  6[5]  =  0 

(  gcd  i  I  2  <  i  <  4  :  f*)  =  2*  gcd  3*  gcd  4^ 

With  the  single  notation  (22)  for  quantification,  we  can  discuss  bound  variables,  scope  of 
variables,  free  variables,  and  textual  substitution  for  all  quantifications,  once  and  for  all. 
Further,  we  can  give  socioms  that  hold  for  all  such  quantifications  (see  Table  4). 

Having  discussed  quantification  in  detail,  we  then  turn  to  pure  predicate  logic  itself. 
This  calls  for  just  a  few  more  axioms  that  deal  specifically  with  universal  and  existential 

^  If  *  is  associative  and  symmetric  but  has  no  identity,  then  instances  of  the  axioms  of  Table  4  that  have 
a  false  range  do  not  hold. 

*  Our  logic  deals  with  types,  but  space  limitations  preclude  a  discussion  of  types  here. 
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Table  5:  Additional  Axioms  for  Predicate  Calculus 

(31)  Trading:  {'ix\  R  :P)  =  (Vx|:  ft  =>  P) 

(32)  Distributivity  of  V  over  V:  P  V  (Vi  \  R  :  Q)  =  (Vx  \  R  :  P  V  Q) 

(33)  (Generalized)  De  Morgan:  (3i  I  P  :  P)  =  -'(Vx  |  R  :  -<P) 


quantification  — see  Table  5.  We  do  bow  to  convention  and  use  V  and  3  instead  of  A  and 
V  . 

Range  R.i  in  notation  (22)  for  quantification  can  be  any  predicate.  For  universal  and 
existential  quantification,  however,  a  range  is  not  necessary.  Nevertheless,  consistency  of 
notation  encourages  us  to  use  a  single  notation,  even  for  universal  and  existential  quantifi¬ 
cation.  Furthermore,  in  many  manipulations,  range  R.i  may  remain  unchanged  while  term 
P.i  is  modified,  and  the  separation  of  the  range  and  term  makes  this  easier  to  see.  Here, 
the  desire  for  ease  of  formal  manipulation  has  dictated  the  choice  of  notations. 

Issues  of  scope,  bound  variables,  etc.,  make  quantification  and  predicate  calculus  far 
more  complicated  than  propositional  calculus.  Some  may  even  feel  that  quantification  is 
too  complicated  for  freshmen  and  sophomores.  However,  many  courses  in  math,  computer 
science,  physics,  and  engineering  require  quantification  in  one  form  or  another,  so  not  teach¬ 
ing  quantification  means  that  students  are  unprepared  for  those  classes.  In  fact,  the  lack  of 
knowledge  of  basic  tools  like  quantification  may  explain  partly  why  students  are  apprehen¬ 
sive  about  mathematics. 

Thus,  we  advocate  teaching  quantification  carefully,  completely,  and  rigorously,  but  in  a 
manner  that  instills  confidence.  We  have  found  that  this  can  be  done. 


5  Examples  of  the  use  of  predicate  calculus 

In  our  course,  equational  logic  serves  as  a  springboard  to  the  study  of  other  topics.  After 
all,  many  topics  in  discrete  math  can  be  viewed  as  extensions  of  pure  predicate  calculus 
— new  types  of  values  are  introduced  and  operations  on  them  are  defined  by  adding  axioms 
to  the  logic.  We  provide  some  brief  glimpses  of  how  this  works. 


Set  theory 

In  this  article,  we  use  conventional  notation  for  set  comprehension,  {x  1  P.x}  ,  but  restrict 
X  to  be  a  bound  (dummy)  variable  (i.e.  not  an  expression).  We  define  membership  in  a  set 
comprehension  as  follows. 

(34)  Set  membership:  Provided  x  does  not  occur  free  in  E  , 
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Ei{x\  P}  =  (3x\  P:x  =  E)  . 

Recall  from  (3)  that  set  equality  is  defined  by  5  =  T  =  (Var  |:  r  6  5  =  x  €  T) . 

Section  2  already  compared  equational  proofs  in  set  theory  to  informal  English,  so  we 
won’t  do  that  again  here.  Instead,  we  discuss  Russell’s  paradox,  which  is  usually  covered  in 
elementary  set  theory.  We  define  Russell’s  paradoxical  set  S  (if  it  is  allowed)  by  its  mem¬ 
bership  test  — remember  that  set  union  and  intersection  were  also  defined  by  membership 
tests. 

(35)  z  €  5  =  X  0  X  for  all  sets  x  . 

S  is  a  set,  and  instantiating  z  with  S  in  (35)  yields 
SeS  =  S«S 

which  is  false  .  An  inconsistency  arose  by  introducing  set  S .  We  conclude  that  S  is  not 
well  defined  and  refuse  to  allow  (35).  That’s  all  there  is  to  it!  There  is  no  need  for  the 
confusing,  English,  ping-pong  argument  that  one  finds  in  many  discrete-math  texts. 

Theory  of  integers 

The  integers  can  be  explored  by  extending  predicate  logic  with  the  new  type  Z ,  giving  the 
axioms  for  operations  on  its  elements,  and  then  proving  various  theorems.  The  same  idea  was 
used  in  introducing  sets,  so  the  students  see  a  pattern  emerging.  Once  the  pattern  is  seen, 
one  need  not  dwell  on  the  proofs  of  all  the  theorems  concerning  addition,  multiplication, 
etc.,  since  the  students  are  already  familiar  with  most  of  the  them. 

One  can  spend  time  on  new  integer  functions  and  operations.  One  example  is  the  greatest 
common  divisor.  We  use  infix  operator  gcd  for  this  and  define  it  as  follows.  (We  write  the 
maximum  of  two  integers  6  and  c  as  6  |  c  and  assume  that  theorems  for  t  and  abs  have 
already  been  taught.) 

(36)  b  gcd  c=(td|d|6Ad|c:d)  (for  6,  c  not  both  0 ) 

0  gcd  0  =  0 

The  first  line  of  (36)  does  not  define  0  gcd  0 ;  since  all  integers  divide  0  ,  0  has  no  maximum 
divisor.  We  define  0  gcd  0  to  be  0 ,  so  that  gcd  is  total  over  Z  x  Z. 

Note  that  t  is  associative  and  symmetric,  so  we  can  use  it  in  quantifications.  Having 
spent  a  great  deal  of  time  on  quantification,  little  has  to  be  said  about  using  T  in  this 
manner.  However,  since  T  over  the  integers  does  not  have  an  identity,  we  must  avoid  using 
quantification  theorems  that  rely  on  an  identity. 

The  next  step  in  exploring  gcd  is  to  state  (and  prove  some  of)  its  properties.  This 
step  has  two  goals:  to  familiarize  the  student  with  the  new  operation  and  to  provide  a  basis 
for  later  manipulations  using  gcd .  Function  gcd  is  symmetric,  is  associative,  satisfies 
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6  gcd  6  =  abs.b  ,  has  1  as  its  zero  (i.e.  1  gcd  6=1),  has  0  as  its  identity  (i.e.  0  gcd  6  =  6), 
and  so  on. 


Let  us  prove  6  gcd  6  =  aba.b .  A  few  points  about  the  proof  given  below  are  worth 
mentioning.  It  contains  a  case  analysis,  because  the  definition  of  gcd  does.  We  try  to 
avoid  case  analysis,  but  it  is  not  always  possible.  Second,  the  proof  is  written  partly  in 
English,  since  that  is  the  easiest  way  to  see  the  case  analysis.  Thus,  we  are  not  completely 
rigid  in  our  proof  style.  However,  the  proof  does  contain  two  equational  subproofs. 

Proof.  There  are  two  cases  to  consider:  6  =  0  and  6^0. 

Case  6  =  0.  0  gcd  0  =  abs.O 

=  (Def.  of  gcd  (36);  Def.  of  a6s) 

0  =  0  — Reflexivity  of  = 

Case  6^0.  6  gcd  6 

=  (Def.  of  gcd  (36)  — 6^0  by  case  assumption) 

(Tdl  d|6  A  d\b:d) 

=  (Idempotency  of  A  ,  p  A  p  =  p ) 

(TdU|6:d) 

=  (The  maximum  divisor  of  6  is  abs.b 
—  6^0  by  case  assumption) 
abs.b 

Mathematical  induction 

Strong  induction  over  the  integers  can  be  defined  by  the  following  axiom,  where  the  dummies 
are  of  type  natural  number. 

Mathematical  Induction: 

(Vn|:P.n)  =  P.O  A  (Vn|:  (V»  I  0  <  i  <  n  :  F.r)  =cP(n4-l)) 

Our  earlier  study  of  quantification  has  given  students  the  manipulative  skills  they  need  to 
prove  formulas  by  induction.  For  example,  proving  n*  =  (Si  I  1  <  i  <  n  ;  2*i  —  1)  requires 
knowledge  of  axioms  and  theorems  for  manipulating  summations. 

Furthermore,  the  rigorous  formulation  of  induction  and  proofs  by  induction  helps  clar¬ 
ify  certain  points  that  ate  usually  confusing.  Here  is  an  example  of  this.  When  proving 
statements  by  induction,  we  always  put  them  in  the  form 

(Vn |:  0  <  r» :  P.n)  where  P.n  :  ... 

Thus,  we  formalize  the  theorem  to  be  proved  and  name  the  induction  hypothesis.  For 
example,  suppose  we  want  to  prove  6"*'''"  =  6"*-6"  for  n,  m  natural  numbers.  Writing  this 
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Table  6:  Equivalence  of  Mathematical  Induction  and  Well-foundedness 

(U,-<)  is  well  founded 
(Definition  of  well-foundedness) 

(V5|:5^0  =  (3x1;  a:  €  S  A  (Vy  I  y -<  I  :  y  «5))) 

{X  =  Y  =  -'X  =  -'V  ;  Double  negation) 

(VS1:S  =  0  =  ->(3x1:  X  e  5  A  (Vy  I  y -<  X  :  y  (?S))) 

(De  Morgan,  twice) 

(V5|:S  =  0  =  (Vx|:x«5  V-(Vy|y^x:y€S))) 

(Change  dummy,  using  P.z  =  z  e S) 

(VP|:(Vxl:P.x)  =  (Vx  I :  P.x  V  ^(Vy  |  y  ^  x  ;  P.y))) 

(Law  of  implication,  p  ^  q  =  ~'P  Y  q) 

(VP  I :  (Vx  I :  P.x)  =  (Vx  I :  (Vy  |  y  X  x  :  P.y)  =>  P.x)) 

(Definition  of  induction  over  {U,  -<) ) 

(U,  ■<)  admits  induction 


formula  as 

(Vn  I  0  <  n  :  P.n)  where  P.n  :  (Vm  |  0  <  m  :  =  6”*  *6”) 

clarifies  what  the  “induction  variable”  is,  by  making  it  am  explicit  argument  of  P .  Here, 
our  style  imposes  a  measure  of  precision  that  is  almost  impossible  to  obtain  with  an  English 
argument.  If  one  does  not  know  how  to  write  quantifications,  it  is  difficult  to  explain  the 
different  roles  of  m  and  n  in  this  proof  by  induction. 

Finally,  the  background  on  predicate  logic  allows  us  to  go  into  mathematical  induction 
more  deeply  than  is  usually  possible  in  a  first  course.  Consider  a  pair  {U,  <) ,  where  U 
is  a  nonempty  set  and  -<  a  binary  relation  over  U  .  When  can  one  use  induction  on 
{U,  -<)  ?  Answering  this  question  rigorously  gives  the  students  a  much  better  understanding 
of  induction  than  has  hitherto  been  achieved. 

The  pair  (£/,  -<)  is  well  founded  iff  every  nonempty  subset  of  U  has  a  minimal  element 
(with  respect  to  -< ).  In  Table  6,  we  prove  that  {U,  -<)  is  well  founded  exactly  when  it  admits 
induction  (the  second  formula  in  the  proof  is  the  formal  definition  of  well-foundedness, 
while  the  penultimate  formula  is  the  definition  of  induction  ^ ).  This  proof  is  so  simple  that 
students  can  be  asked  to  reconstruct  it  on  a  test  — and  95%  of  them  do  so.  Such  performance 
is  impossible  using  a  proof  in  English,  even  though  the  two  proofs  rely  on  the  same  idea  of 
defining  a  predicate  P  in  terms  of  a  set  S  (and  vice  versa).  Students  don’t  memorize  the 
proof  character  for  character;  instead,  they  understand  the  basic  idea  and  develop  the  proof 
afresh  each  time  they  want  to  present  it. 

^  The  formal  definitioiia  of  well  foundedness  and  induction  require  second-order  predicate  c2dculu8. 


16 


Generating  functions 

Solving  recurrence  relations  (homogeneous  difference  equations)  using  generating  functions 
is  sometimes  taught  in  discrete  math.  The  generating  function  G{z)  for  a  sequence  xq  , 
Xi  ,  zj  ,  ...  is  the  polynomial  xo*2°  +  xi‘Z^  +  X2*x^  +  . . . ,  or 

(Ei  1  0  <  i :  xo*2‘)  . 

Thus,  the  generating  function  for  a  sequence  is  just  a  different  representation  of  the  sequence. 

Many  useful  generating  functions  can  be  written  in  a  closed  form,  but  for  the  student 
with  little  knowledge  of  the  axioms  for  manipulating  quantifications,  the  derivation  of  these 
closed  forms  is  a  black  art.  However,  students  with  skill  in  manipulating  quantifications  can 
do  more  than  follow  the  proofs;  they  can  discover  the  closed  forms  themselves,  once  they 
have  been  shown  the  basic  idea. 

To  see  this,  we  now  calculate  the  closed  form  of  the  generating  function  G(z)  for  the 
sequence  c°,c^,c^,...  for  some  nonzero  c.  The  calculation  starts  with  the  definition  of 
the  generating  function.  Thereafter,  the  calculation  unfolds  in  an  opportunistic  or  forced 
fashion:  at  each  step,  the  shape  of  the  formula  guides  the  development  in  an  almost  unique 
way.  We  have  extended  the  hints  of  the  proof  with  comments  that  motivate  each  step. 


G(2)  =  (Et|0<i:c<-2‘) 

(Split  off  term  — this  is  the  simplest  atnd  almost  the  only 
change  possible.) 

G(2)  =  c°-2°  +  (Et  I  l<i:c*-20 

(Arithmetic  — isolate  the  complicated  term) 

G(2)-  1  =  (Ei  I  1  <  i  :c‘-2*) 

(Change  dummy  — obvious  way  to  remove  the  summation 
is  to  use  the  definition  of  G{z)  ;  change  the  range  of  the 
summation  to  make  it  the  same  as  that  of  C  ) .) 

G(z)  -  1  =  (Ei  I  0  <  i :  c‘+‘  •2<+^) 

(Distributivity  — expose  the  RHS  of  definition  of  G(2) ) 
G(z)-  1  =  c-r-(Ei  I  0  <  i  :c'-z') 

(Definition  of  G{z) ) 

G(z)  —  1  =  C'2*G(2) 

(Arithmetic) 

G(z)  =  1/(1-c-2) 


6  Intuition  and  understanding 

The  approach  we  have  presented  has  not,  as  yet,  been  universally  accepted.  The  most 
frequent  criticism  we  hear  is  that  stressing  formal  manipulation  will  impede  the  develop- 
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ment  of  intuition  and  understanding.  Also,  our  critics  fear  that  students  will  lose  track  of 
“meaning”,  or  semantics.  We  now  address  these  issues. 


On  intuition  and  discovery 

By  intuition,  one  usually  means  direct  perception  of  truth  or  fact,  independent  of  any 
reasoning  process;  keen  and  quick  direct  insight;  or  pure,  untaught,  noninferential  knowledge 
( Webster’s  Encyclopedic  Unabridged  Dictionary,  1989).  There  is  simply  no  hope  of  teaching 
this  — how  can  one  teach  something  that  is  untaught,  noninferential,  and  independent  of 
any  reasoning  process?  Of  course,  one  can  hope  that  students  will  develop  ^m  ability  to 
intuit  by  watching  instructors  in  math  courses  over  the  years.  But  this  hit-or-miss  prospect 
cannot  be  called  teaching  intuition. 

On  the  other  hand,  a  good  part  of  mathematics  is  concerned  with  the  opposite  of  intu¬ 
ition;  with  new  and  different  reasoning  processes  that  complement  our  ability  to  reason  in 
English.  This  part  of  mathematics  can  be  taught,  and  our  approach  to  logic  is  an  excellent 
vehicle  for  that  task. 

The  question  may  then  arise  whether  students  can  be  taught  something  about  discovery 
that  does  not  hinge  on  intuition.  Here,  our  syntactic  method  of  proof  outshines  the  more 
conventional  proof  methods  of  reasoning  in  English.  We  are  able  to  teach  aids  to  discov¬ 
ery.  In  particular,  with  our  disciplined,  syntactic,  proof  style,  we  can  teach  principles  and 
strategies  whose  application  can  indeed  lead  to  proofs  in  many  (but  not  all)  cases.  We  have 
yet  to  see  comparable  principles  and  strategies  for  conventional  English  proofs. 

Finally,  in  many  cases,  formalization  and  syntactic  manipulation  can  set  the  stage  for 
discovery.  For  example,  recall  the  earlier  discussion  of  closed  forms  of  generating  functions. 
The  chance  of  intuition  or  English  reasoning  helping  to  discover  closed  forms  is  small,  but 
freshman  and  sophomores  csm  be  taught  to  discover  them  through  syntactic  manipulation. 

Recall  also  the  pigeonhole  principle,  which  for  a  hundred  years  has  been  presented  as; 

If  more  than  n  pigeons  are  placed  in  n  holes,  at  least  one  hole  will  contain 
more  than  one  pigeon. 

We  formalize  it.  Consider  the  bag  (multiset)  5  of  numbers  giving  the  number  of  pigeons 
in  each  hole.  For  example,  if  the  first  two  holes  contain  3  pigeons  each  and  the  third  hole 
1 ,  the  bag  is  5  =  {|1,3,3J.  The  statement  “more  than  n  pigeons  are  placed  in  n  holes” 
is  formalized  as  average.S  >  1 .  The  statement  “At  least  one  hole  contains  more  than 
one  pigeon”  is  formalized  as  maximum.S  >  1 .  Hence,  the  pigeonhole  principle  can  be 
formalized  as 

(37)  Pigeonhole  principle:  average.S  >  1  =►  maximum.S  >  1 . 
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Now,  (37)  should  not  be  surprising,  since  the  average  of  a  set  of  numbers  is  bounded  above 
by  their  maximum.  In  fact,  we  could  take  the  following  as  a  generalized  principle. 

(38)  Generalized  pigeonhole  principle:  average. S  <  maximum.S  . 

Clearly,  (38)  implies  (37)  but  not  the  converse.  And,  there  are  situations  in  which  (38)  can 
be  used,  but  not  (37).  Hence,  formalization  led  to  an  improvement. 


On  meaning  or  semantics 

It  has  also  been  said  that  semantics  or  “meaning”  is  lost  in  our  approach  to  proofs.  The 
English  proof  of  distribution  of  union  over  intersection  on  page  3  is  full  of  meaning,  it  is 
said,  while  our  syntactic  proof  suppresses  meaning. 

However,  an  English  argument  can  be  formalized  in  a  natural-deduction  logic.  The 
resulting  natural-deduction  proof  is  just  as  much  a  syntactic  argument  as  an  equational 
proof,  so  its  English  counterpart  is  just  an  informal  version  of  a  syntactic  proof  (in  which 
inference  rules  are  not  mentioned).  There  is  nothing  “semantic”  about  it. 

Thus,  comparing  proofs  based  on  meaning  or  semantics  is  a  red  herring. 

Let  us  discuss  meaning  or  semantics  in  relation  to  the  introduction  of  an  operator  like 
set  union.  The  operator  can  be  explained  in  several  ways.  One  can  illustrate  set  union 
with  a  Venn  diagram,  define  in  English  what  it  means,  show  how  to  evaluate  a  set  union, 
give  examples,  and  give  a  formal  definition.  Having  different  views  provides  redundancy, 
which  is  helpful  in  promoting  full  comprehension,  and  our  formalistic  approach  to  teaching 
mathematics  does  not  preclude  meaning-ful  discussions  of  alternative  views.  However,  the 
student  should  be  made  to  realize  that  for  purposes  of  reasoning  — constructing  proofs —  it 
is  the  axiomatic  definition  that  is  important.  In  fact,  the  student  should  view  the  axiomatic 
definition  as  encoding  all  the  meaning  of  the  object  being  defined. 


On  understanding 

What  really  is  at  issue  is  the  question  of  understanding:  which  approach,  the  equational  or 
the  conventional  one,  provides  more  understanding? 

A  proof  should  convey  belief  in  the  theorem.  A  proof  provides  evidence  for  belief,  where 
the  evidence  consists  of  the  facts  (e.g.  previously  proved  theorems)  on  which  it  rests  and  on 
how  these  facts  interact  to  convince.  We  understand  the  proof  when  we  understand  which 
facts  are  used  and  how  they  interact.  Full  understanding  also  implies  the  ability  to  explain 
the  proof  to  others  and  perhaps  to  prove  other  theorems  with  similar  proofs. 

Now  look  at  the  English  proof  of  distribution  of  union  over  intersection  on  page  3.  That 
proof  does  not  state  the  facts  on  which  it  rests.  For  example,  it  says,  “If  y(  A  ,  then,  since 
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yiA\jB  we  must  have  y  €  5  ” ,  but  there  is  no  reference  to  the  theorem  that  explains  why 
this  inference  holds.  This  is  typical  of  most  proofs  in  English:  the  underlying  facts  are  left 
to  be  inferred  by  the  reader. 

Second,  in  that  English  proof,  it  is  difficult  to  see  precisely  how  the  f2u;ts  interact.  The 
proof  cannot  be  easily  learned  and  then  explained  to  others.  And,  having  seen  the  proof, 
students  still  have  difficulty  proving  similar  theorems.  In  fact,  showing  the  proof  to  students 
gives  little  insight  into  proofs  in  general. 

Thus,  from  the  view  of  providing  “understanding”,  the  English  proof  is  deficient. 

Now  turn  to  the  equational  proof  on  page  4.  Every  fact  used  in  the  proof  is  stated.  The 
uses  of  the  three  inference  rules  are  given  implicitly  by  the  structure  of  the  proof,  and  the 
premises  of  uses  of  Leibniz’s  rule  are  stated  explicitly.  Thus,  “understanding”  this  proof,  in 
terms  of  the  facts  it  uses  and  their  interaction,  is  almost  trivial.  Further,  the  proof  uses  a 
simple  proof  strategy:  eliminate  operators  using  their  definitions,  perform  manipulations, 
and  reintroduce  the  operators.  Since  the  strategy  is  one  that  is  used  often,  it  can  be  taught 
and  then  applied  by  students  in  many  different  cases.  In  fact,  after  studying  this  proof, 
a  student  should  have  little  difficulty  developing  it  afresh  when  it  is  to  be  explained  to 
someone  else.  The  student  has  full  understanding. 

In  short  the  conventional  proof  method  has  trouble  promoting  full  understanding,  while 
the  equational  proof  method  makes  understanding  easy. 

As  another  example,  look  at  the  proof  of  equivalence  of  well-foundedness  of  {U,  -<)  and 
mathematical  induction  over  (£/,  -<) .  Rarely  do  texts  give  an  English  proof  of  this  equiva¬ 
lence,  because  it  would  be  too  difficult.  The  equational  proof,  however,  is  fully  understood 
by  most  of  our  students.  Further,  the  equational  proof  stunningly  brings  to  light  the  im¬ 
portant  fact  upon  which  the  proof  is  based,  namely  the  ability  to  put  predicates  and  sets  in 
correspondence  using  the  relation  P.z  =  zi  S .  The  corresponding  English  truth,  because 
its  structure  is  so  complex,  hides  this  fact. 


7  Experience  with  our  approach 

In  Spring  1993,  we  taught  a  discrete-math  course  to  70  students  (mainly  freshman  and 
sophomores)  in  the  Computer  Science  Department  at  Cornell,  using  a  draft  of  our  text  A 
Logical  Approach  to  Discrete  Math  (Springer- Verlag,  New  York,  1993)  [5].  This  experience 
showed  that  students: 

•  Lose  their  fear  of  mathematics  and  mathematical  notation. 

•  Gain  a  good  understanding  of  proofs  and  their  development. 

•  Acquire  some  skill  in  formal  manipulation  and  begin  almost  immediately  to  apply  that 
skill  in  other  courses. 
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•  Gain  an  appreciation  for  rigor,  so  much  so  that  they  ask  for  it  in  later  courses. 

Further,  we  found  that  part  of  the  six  weeks  spent  on  logic  could  be  made  up  in  the  latter 
part  of  the  course,  because  students  understood  the  later  material  more  easily.  We  also  feel 
that  material  in  subsequent  courses  can  be  taught  more  effectively  and  efficiently  because 
students  are  better  prepared,  although  we  have  no  firm  evidence  on  this  yet. 

Near  the  end  of  our  discrete-math  course,  we  asked  the  students  to  write  a  short  essay 
explaining  what  they  had  learned.  Five  wrote  negative  comments,  but  the  other  65  were 
overwhelmingly  positive.  This  is  the  first  time  we  experienced  or  heard  of  an  overwhelmingly 
enthusiastic  response  to  a  discrete  math  course.  All  their  comments  are  summarized  in  the 
first  chapter  of  the  Instructor’s  Manual  [6],  where  the  gestalt  of  the  course  is  discussed.  We 
repeat  a  few  comments  here. 

This  course  was  really  groovy,  perhaps  my  favorite  one  ....  I  am  a  math  major 
who  never  thought  I  would  enjoy  doing  proofs,  until  taking  this  course. 

Stressing  logic  has  helped  my  methods  of  thinking.  I  definitely  have  a  better 
understanding  of  proof. 

In  my  linear  algebra  class,  !  understand  proofs  with  much  less  difficulty,  and 
proving  theorems  about  matrices  has  become  easier  as  well. 

My  math  homework  from  last  week  ended  with  two  proofs,  and  I  was  happy 
about  it! 

There  are  real  applications  out  there  for  predicate  and  propositional  calculus 
and  an  observable  benefit  to  formalism. 

I  really  enjoyed  . . .  [this  course],  because  of  the  rigor  and  techniques  of  proof  I 
learned. 

The  class  dissolved  my  fear  of  proving  things,  and  I  find  proofs  easier  to  tackle 
now. 

To  be  perfectly  honest,  I  enjoyed  it,  which  is  surprising  considering  I  do  not 
enjoy  math. 

This  course  taught  me  the  most  about  how  — to  think  logically —  in  all  the  many 
courses  I  have  taken  in  my  Cornell  career. 

A  draft  of  text  [5]  was  also  used  in  Fall  1993  by  Sam  Warford  at  Pepperdine,  who 
considered  his  course  “a  big  success” . 

We  hope  that  others  will  try  out  this  new  approach  to  teaching  math.  We  think  that 
the  approach  produces  students  with  far  better  math  skills  and  that  it  should  be  integrated 
into  all  math  curricula.  In  fact,  a  course  for  non-majors  based  on  this  approach  could  serve 
as  a  replacement  for  calculus. 
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tion,  required  too  much  formal  detail  and  gave  no  insight.  The  equational  approach  was 
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